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Abstract. This contribution extends the Bron Kerbosch algorithm for 

solving the maximum weight clique problem, where continuous-valued 
weights are assigned to both, vertices and edges. We applied the proposed 
algorithm to graph matching problems. 

1 Introduction 

Comparing structural variations of two graphs is a fundamental task in pattern 

recognition, which finds its appHcations in diverse areas such as computer vision, 
bioinformatics, and computational chemistry. 

Since the graph matching problem is well-known to be NP-hard, there is 
an ongoing research on devising optimal and approximate graph matching algo- 
rithms. One popular technique consists in transforming graph matching to an 
equivalent clique search in a derived auxiliary structure, called association graph 
[1-3,9-11; 13]. In [6], it has been shown that a broad range of graph matching 
problems can be reduced to the maximum weight clique problem. Examples 
include divers graph distances and matching problems such as the graph edit 
distance, dissimilarities based on the maximum common subgraph, hierarchical 
tree matching, geometric graph distance functions, and many-to-many graph 
matching problems, to mention a few. 

A maximum weight clique is a clique of a weighted graph with maximum sum 
of vertex and edge weights. The problem is that clique search algorithms are only 
well investigated for graphs without weights or graphs with vertex weights, only. 

In this paper, we present an extension of the Bron-Kerbosch algorithm [4] 
for solving the maximum weight clique problem where the underlying graph has 
continuous- valued weights assigned to both, vertices and edges. We present und 
discuss first experiments. 

2 Preliminaries 

2.1 Attributed Graphs 

Let ^ be a set of attributes and let e G ^ be a distinguished element denoting 
the null or void element. An attributed graph is a tuple X = {V, a) consisting of 
a finite nonempty set V of vertices and an attribute function a : V x V ^ A. 



Elements of the set E = ^VxV : i ^ j and a{i,j) ^ e} are the edges 

of X. 

In this definition, attributes assigned to vertices i € V are given by a{i, i) and 
edges are characterized by pairs of distinct vertices that have non-null attributes. 
The vertex set of an attributed graph X is often referred to as Vx and its 
attribute function as ax- 

A subgraph of X is a graph Y with vertex set Vy C Vx and attribute function 
ayihj) <= {oix(i,j),£} for all i,j S Vy. We write Y C X to denote that y is a 
subgraph of X. An induced subgraph of X is a subgraph Y (~ X with attribute 
function ay = a|w. We write X[VV] to denote the subgraph of X induced by 
the vertex set Vy. 

A graph is said to be complete if all of its vertices are mutually connected 
by an edge. A clique of a graph X is a subset C C Vx such that the induced 
subgraph X[C] is complete. A clique C of X is said to be maximal if C is not 
contained in any larger clique of X. A maximum clique is a clique of X with 
maximum cardinality of vertices. 

The set N{i) = {j e Vx ■ e E} defines the set of all vertices of X 
adjacent to i € Vx- Note that the set N{i) excludes vertex i. The number 
deg(i) = \N{i)\ is the degree of vertex i e Vx- 

Suppose that y is a subgraph of X. The deletion of F in X is defined by the 
graph Z = X — Y with vertex set Vz = Vx and attribute function 

I- .X fe : e Vy 

\oix{i,3) '■ otherwise 

Let X and Y be graphs. A partial morphism from X to y is a partial injective 
mapping 

(l):Vx^Vy, i^ i't'. 

By T>{4>) C Vx we denote the domain oi (j) - A morphism is a partial morphism 
(j) which can not be extended to a partial morphism (p' with larger domain, that 
is T>{(f)) C ^{(f)'). By A4{X, Y) we denote the set of all morphisms from X to Y. 



2.2 The Maximum Weight Clique Problem 

A weighted graph is a graph Z = {V, a), where the underlying attribute set is of 
the form A = MU{e}. An unweighted graph is a weighted graph Z with attribute 
set of the form A = {0, 1} and an attribute function az that assigns each vertex 
the value and each edge the value 1 as its attribute. Thus, the distinguished 
null attribute s is represented by the value 0. 

Suppose that i e Vz is a vertex of a weighted graph Z. The weighted degree 
of i is defined by 



wdegzii) = az{i,i) + ^ az{i,j). 



Note that in the case of unweighted graphs the notion of degree and weighted 
degree coincide. 

The weight of a cHque C of Z is defined by 

u;{C)= J2 "(^'i) 
ijec 

The weight of a clique C is the total sum of all vertex and edge weights of the 
induced subgraph Z[C]. Since the vertices of Z[C] are mutually adjacent, the 
null attribute s does not occur in the definition of u){C). 

A maximum weight clique problem is a combinatorial optimization problem 
of the form 

maximize a;(C) 
subject to C € C{Z), 

where C{Z) is the set of all cliques of Z. Any solution of the maximum weight 
clique problem is a maximum weight clique of Z. A maximal weight clique of Z 
is a clique C of Z such that 

CCC' ^ uj{C) > uj{C') 

for all cliques C of Z. It is impossible to enlarge a maximal weight clique C 
to a clique C with higher weight. If all vertices and edges of Z are associated 
with positive weights, a maximal weight clique is not a proper subset of another 
clique. 

2.3 Graph Matching as Clique Search 

To measure the structural variation of two graphs, we consider the following 
(indefinite) graph kernel 

fc(X,F)= max kA{ax{i,j),aY(itf)), 
4>eMix,Y) , 

where kj( : A y. A ^ R+ is a positive definite kernel defined on the set A of 
attributes. The graph kernel k{X, Y) induces the notion of a well-defined length 
of a graph X by 

1{X) = ^k{X,X). 

As shown in [7] , the graph kernel together with the length of a graph satisfy the 
Cauchy-Schwarz inequality 

\k{X,Y)\<l{X)-l{Y). 

Suppose that X and Y are two attributed graphs. We transform the problem 
of computing the graph kernel k{X, Y) to the maximum weight clique problem 



Algorithm 1 (Basic Bron-Kerbosch for enumerating all maximal weight cliques) 



01 call: 6fe(0, V,0) 

02 bk{C,P,S) 

03 if P = and S' = then 

04 report maximal weight clique C 

05 for each vertex i £ P do 

06 hk{cyj{i},Pr\N{i),sr\N{i)^ 

07 P = P\{i} 

08 S = S\J{i} 



of an association graph of X and Y . An association graph Z = X ®Y oi X and 
Y consists of a vertex set Fz = Vx x W and an attribute function of the form 

az-Vz^Vz-^ R, {{i,j),{r,s)) ^ j), (r, s)). 

As shown in [6] , there is a one-to-one correspondence between the optimal solu- 
tions of the graph matching problem and the maximum weight cliques of Z. 

3 Extension of Bron-Kerbosch to the MWCP 

3.1 Basic Bron-Kerbosch 

Extension of the Bron-Kerbosch algorithm from enumerating all maximal cliques 
of an unweighted graph to enumerating all maximal weight cliques of a weighted 
graph is straightforward, since the notions of maximal clique and maximal 
weighted clique coincide for graphs with positive weights. Algorithm 1 out- 
lines the standard Bron-Kerbosch procedure for enumerating the maximal weight 
cliques of a given graph Z = {V,a). The algorithm operates on three disjoint 
subsets C, P, and S of vertices from V. The set C contains the vertices belonging 
to the current clique. Set P maintains all prospective vertices, each of which is 
connected to all vertices of C. Vertices from P are used for expanding the cur- 
rent clique C. Finally, the set S contains all vertices that can no longer be used 
for completion of C, because all maximal cliques containing these vertices have 
already been reported. The Bron-Kerbosch algorithm is called with C = S = $ 
and P = V. 

3.2 Bron-Kerbosch with Pivoting 

In the case of unweighted graphs, the standard Bron-Kerbosch procedure de- 
scribed in Algorithm 1 is inefficient in the case of graphs with many non-maximal 
cliques. Bron and Kerbosch [4] introduced a variant of the standard algorithm 



involving a pivot vertex ip chosen from P.^ Any maximal clique of S either in- 
cludes the pivot vertex ip or one of the vertices i € P \ N{ip) not adjacent to 
ip. Therefore, only the pivot vertex ip and vertices from P not adjacent to ip 
need to be considered as expansions of the current clique R in each recursive 
call of the Bron-Kerbosch algorithm. Vertices i from P adjacent to ip can be 
skipped, because any clique containing i must also contain ip. Such a clique will 
be discovered in a subsequent recursive call once ip has been added to C. Al- 
gorithm 2 presents the Brou-Kerbosch procedure with pivoting for enumerating 
all maximal weight cliques of Z. 



Algorithm 2 (Bron-Kerbosch with pivoting) 

01 call: 6fe(0,y,0) 

02 bk{C,P,S) 

03 if P = and 5 = then 

04 report maximal weight clique C 

05 choose pivot vertex ip £ P 

06 for each vertex i £ P \ N(ip) do 

07 bk(cu{i},PnN{i),snN{i)'j 

08 P = P\{i} 

09 S = SU{i} 



The challenge of Bron-Kerbosch with pivoting consists in finding good pivot 
selection strategies. In the case of unweighted graphs different strategies have 
been suggested (sec e.g. [?]). For the more general case of weighted graphs, we 
suggest the following pivot selection strategies: 

1. Random selection: 

The pivot vertex ip is randomly chosen from the set P. 

2. Max-weighted degree selection: 

The pivot vertex ip is chosen from P according to the rule 

wdeg2[cuP](v) > wdcg2[c'LJP](*) 

for all i E P, where the weighted degree is taken with respect to the subgraph 
Z[CUP] induced by the vertices of C U P. 

3. Max-weight clique selection: 

The pivot vertex ip is chosen from P according to the rule 

a;(cu{ip}) > w(cu{i}) 

for all i G P. 

^ As shown by [8], the pivot vertex can be more generally chosen from PL) S. We do 
not consider this case here. 



3.3 Bron-Kerbosch for Solving the MWCP 



Often it is sufficient to report a single maximum weight clique rather than enu- 
merating all maximal weight cliques. In this case, we modify Algorithm 2 by 
recording the maximal weight clique i?* with maximum weight found so far. To 
improve efficiency, we introduce a function h : C{Z) — >■ M_|_ with the following 
property: 

VC, C e C{Z) : CCC' ^ w(C) + h{C) > uj{C') 

Similarly, as in the y4.*-algorithm, the function h estimates the total weight ob- 
tained by expanding the current clique to a maximal clique. We demand that 
uj{C) + h{C) always overestimates the total weight of any clique containing C . 

Algorithm 3 outlines the Bron-Kerbosch procedure for solving the MWCP. 
The set C, is a global variable which can be initialized by the empty set or an 
arbitrary clique of Z. 



Algorithm 3 (Bron-Kerbosch for Solving the MWCP) 

01 initialize C* 

02 call: 6fe(0,\/,0) 

03 hk{C,P,S) 



04 if P = and 5 = then 

05 if uj{C) > a;(C.) then 

06 C, = C 

07 if a;(C)-F/i(C) >a;(a) then 

08 choose pivot vertex ip € P 

09 for each vertex i G P\ N{ip) do 

10 hk(^cu{i},PnN{i),snN{i)J 

11 P = P\{i} 

12 S = SU{i} 



Besides finding a good strategy for selecting the pivot vertex, a challenge for 
improving the efficiency of Algorithm 3 consists in formulating a good estimate 
function h. We suggest the following estimate functions h for the maximum 
weight clique problem of Z. 

1. Max-weight degree estimate (deg): 

ft-dog : -> M_|_, C H^- maxwdeg(i) 

2. Cauchy-Schwarz estimate (cs): 

We assume that Z = X (g) y is an association graph of X and Y. Suppose 



data set 


# (classes) 


avg(nodes) 


max(nodes) av; 


?(edges) 


max(edges) 


letter 


15 


4.7 


8 


3.1 


6 


grec 


22 


11.5 


24 


11.9 


29 


coil 


100 


8.3 


26 


14.1 


48 


molecules 


2 


24.6 


40 


25.2 


44 



Table 1. Summary of main characteristics of the data sets. The graphs were taken 
from the respective training sets. 



that Xq C X and Yc QY are the induced subgraphs obtained by projecting 
the current clique C to the factor graphs X and Y. Let Xc = X — Xq and 
Yc = Y — Yc denote the graphs obtained by deleting Xc in X and Yc in 
Y. Then we have 

\k{Xc,Yc)\<l{Xc)-l(Yc) 
according to the Cauchy-Schwarz inequality. Thus, the estimate function 

hcs : C(Z) ^ M+, C^l (Xc) ■ I (Yc) 

overestimates the remaining weights of any maximal weight clique of Z con- 
taining C. 

The deg-estimate is more general than the cs-estimate and can be applied to 
the generic maximum weight clique problem. In contrast, the cs-estimate is only 
applicable for graph matching problems that calculate geometric graph distance 
functions, that is graph distance functions that are maximizers of inner products. 
Since h^eg < hcs, our choice is the cs-estimate in case of its applicability. 

4 Experiments 

We applied the extended Bron Kerbosch algorithm to the problem of graph 
matching in order to assess its performance and to investigate its behavior. The 
aim is to investigate the effects of different pivoting strategies and compared 
the matching performance of Bron Kerbosch with the graduated assignment 
algorithm [5]. 

Data. For this, we selected the following data sets from the lAM graph database 
repository: letter, grec, coil, and mutagenesis. We used the whole training sets 
of the letter and grec. For coil and mutagenesis, we considered the first 200 
graphs of the respective training sets. Table 1 provides a summary of the main 
characteristics of the data sets. For further details we refer to [12]. 

Protocol. For each data set, we computed the pairwise similarities 
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Fig. 1. Results of Graduated Assignment (ga) and the 4 Bron Kerbosch (bk). Shown are 
the average similarity and computation time as a function of the number of recursions. 



For calculating pairwise similarities, we applied four variants of Bron Kerbosch. 
The variants differ in the choice of the following pivoting strategies: basic (no piv- 
oting), random selection, w-deg selection, and clique selection. All four variants 
of Bron Kerbosch used the Cauchy-Schwarz estimate. We recorded the average 
similarity and computation time of the different variants of Bron Kerbosch after 
a recursive calls, where 

a e {2, 5, 10, 50, 100, 500, 1 000, 5 000, 10 000, 50 000} . 

Results. Figure 1 summarizes the results. From the plots we see that Bron Ker- 
bosch with pivoting (random, w-deg, clique selection) is on average faster and 
scales better with problem size than Bron Kerbosch without pivoting (basic). 
This behavior is in line with findings of the standard Bron Kerbosch algorithm 
for the unweighted maximum clique problem. In addition, Bron Kerbosch using 
w-deg and clique selection outperform Bron Kerbosch using first and random 
selection with respect to solution quality. The solution quality of Bron Kerbosch 
with w-deg and clique selection are comparable. Bron Kerbosch with w-deg se- 
lection, however, is computationally more demanding than Bron Kerbosch with 
clique selection for two reasons: (i) w-deg selection needs significantly more re- 
cursive calls, and (ii) for each recursive call, w-deg selection is computationally 
more expensive than clique selection. These findings make clique selection as our 
first choice for selecting the next pivot vertex. 

Comparing the extended Bron Kerbosch algorithm using clique selection with 
graduated assignment shows that Bron Kerbosch returns significantly better 
results than graduated assignment in less time for letter, grec, and coil. For 
mutagenesis, graduated assignment provides a superior trade-off between speed 
and accuracy. 

5 Conclusion 

The extended Bron Kerbosch algorithm solve the maximum weight clique prob- 
lem, where continuous-valued weights are assigned to both, vertices and edges. 
In doing so, the proposed algorithm is a generic tool for efficiently solving a 
broad range of graph matching problems. 

Further research aims at applying Bron Kerbosch to classification and clus- 
tering problems in the domain of graphs. In addition, we are interested to which 
extent the Cauchy-Schwarz estimate improves Bron Kerbosch using different 
pivoting strategies. 
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